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- The MAILING DATE of this communication appears on the cover sheet with the correspondence address - 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )H Responsive to cx)mmunication(s) filed on 14 September 2004 . 
2a)D This action is FINAL. 2b)H This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 

Disposition of Claims 

4) S Claim(s) 1-26 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) |3 Claim(s) 1-26 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) Q Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10)D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 

Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
11 )□ The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 

Priority under 35 U.S.C. § 119 

12)D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 119(a)-(d) or (f). 
a)D All b)D Some * c)D None of: 

1 .□ Certified copies of the priority documents have been received. 

2.D Certified copies of the priority documents have been received in Application No. . 



3.D Copies of the certified copies of the priority documents have been received in this National Stage 
application from the International Bureau (PCT Rule 17.2(a)). 
See the attached detailed Office action for a list of the certified copies not received. 



Attachment(s) 

1) CH Notice of References Cited (PTO-892) 4) EH Interview Summary (PTO-413) 

2) □ Notice of Draftsperson's Patent Drawing Review (PTO-948) Paper No(s)/Mail Date. . 

3) □ Information Disclosure Statement(s) (PTO-1449 or PTO/SB/08) 5) □ Notice of Informal Patent Application (PTO-152) 

Paper No(s)/Mail Date . 6) □ Other: . 



U.S. Patent and Trademark Office 
PTOL-326 (Rev. 1-04) 
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DETAILED ACTION 

L This action is responsive to communications: The RCE filled on 09/14/04 regarding the 
Affidavit and Remarks filled 07/09/04. 

2. Claims 1-9 and 12-24 remain rejected under 35 U.S.C. 103(a) as being unpatentable over 
Brown et al (US: 5,913,208, 06/15/99). 

3. Claims 10-11 and 25-26 remain rejected under 35 U.S.C. 103(a) as being unpatentable 
over Brown et al (US: 5,913,208, 06/15/99) in view of Microsoft Press Computer Dictionary, 
Microsoft Press, 1997, pp. 309. 

Swearing Back of Reference — Affidavit or Declaration Under 37 CFR 1.131 

4. The declaration under 37 CFR 1.131 filed Jun 7, 2004 is insufficient to overcome the 
rejection of claims 1-26 over Brown under 35 USC § 103 as set forth in the last Office action 
03/29/04 because it is not properly executed and fails to establish reduction to practice prior to 
the date of the reference. 

A. Formalities : The Affidavit is not signed by all inventors: 
The following parties may make an affidavit or declaration under 37 CFR 1 . 13 1 : A party 
qualified under 37 CFR 1.42, 1.43, or 1.47 in situation where some or all of the inventors are not 
available or not capable of joining in the filing of the application (MPEP §715.04). 

Under 37 CFR 1.47, filing when an inventor refuses to sign or cannot be reached: 
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If a joint inventor refuses to join in an application for patent or cannot be found or reached after 
diligent effort, the application may be made by the other inventor on behalf of himself or herself 
and the nonsigning inventor. The oath or declaration in such an application must be accompanied 
by a petition including proof of the pertinent facts, the fee set forth in § 1 .17(h), and the last 
known address of the nonsigning inventor. The nonsigning inventor may subsequently join in the 
application by filing an oath or declaration complying with § 1.63, 

Applicant has not met the criteria above. No petition under Rule 47 has been granted in 
this application. 

B. Reduction to Practice 

Applicant attempts to establish prior invention by showing reduction to practice of the 
invention prior to the publication date of Brown et al (June 15, 1999). In paragraph 2 of the 
affidavit applicant states that the invention "was reduced to writing in the United States prior to 
the June 15, 1999 issue date of the Brown reference." 

A written description does not constitute an actual reduction to practice. Furthermore, 
only the filing of a US patent application which complies with the disclosure requirement of 35 
USC § 1 12 constitutes a constructive reduction to practice. A written description, no matter how 
complete, which has not been made the subject of a US patent application does not qualify as 
reduction to practice. 

Accordingly, applicant has not established prior invention . The rejection is maintained as 
repeated below. 
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Claim Rejections - 35 USC§103 

5. The following is a quotation of 35 U.S.C 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

6. Claims 1-9 and 12-24 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Brown et al (US: 5,913,208, 06/15/99). 

-As per independent claim 1, Brown et al teach a method for identifying duplicate 
documents, where having received said documents generating a hit-list (Fig. 3b) of intrinsic and 
non intrinsic attributes (title, author, creation date, length, size, location, abstract, etc (columns 1 
and 6, lines 64-67 and 51-54)) wherein the table structure (equivalent to metadata summary sub- 
trees) of the attributes of each document pair are compared (i.e. if one document is missing an 
attribute structure then the documents are considered distinct) and each document is shown to be 
distinct from the other if their respective attributes aren't equal (Fig. 4: 430 & 450). Brown et al 
do not disclose that the summary hit-list contain metadata. It was well known in the art that 
document attributes such as title, size, location, etc are considered metadata of a document. 
Brown et al also do not disclose that the summary hit-list includes a sub-tree structure. It would 
have been obvious to one of ordinary skill in the art, to have used Brown et al method of 
comparing hit-lists of attributes and used the structural relations of those hit-list attributes as the 
first determinate of distinct documents, because while both methods provide the same benefit of 
identifying distinct documents, by checking the structure (layout) of those attributes first, it can 
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more quickly be established with the minimum amount of comparisons that the compared 
documents are distinct of one another (i.e. both documents have structures A & B, but since one 
has A before B and the other B before A they are considered distinct.) 

-As per dependent claim 2, Brown et al further disclose wherein the table structure 
contains intrinsic and non intrinsic attributes (title, author, creation date, length, size, location, 
abstract, etc (columns 1 and 6, lines 64-67 and 51-54)) to be compared. 

-As per dependent claim 3, Brown et al further disclose that one of the attributes used for 
determination could be an abstract or a title (text-content)(column 6, lines 51-54). 

-As per dependent claim 4, Brown et al further disclose and that one of the attributes used 
for determination could be an abstract or a title (text-content)(column 6, lines 51-54). 

-As per dependent claim 5, Brown et al further disclose noting the current pair of hit- list 
documents as duplicates if the text content are equal (i.e. the structure and non-text attributes 
have been compared but alone can not determine duplicates) (Fig. 4: 460). 

-As per dependent claim 6, Brown et al further disclose one embodiment of deleting 
duplicates from the hit list group (column 8, lines 20-22: Fig. 8). 
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-As per dependent claims 7-9, Brown et al further disclose that in the hit-list (metadata 

table) when an entry is identified as having one or more duplicates is cross referenced in the 

duplicate identifier field of the other duplicate (column 8, lines 1 1-16), wherein for two metadata 
entries the entry number and duplicate identifier field constitute a 2x2 matrix (Fig. 3b). Brown 
et al also teach that the process of identifying duplicates as distinct is achieved by comparing the 
metadata structures, attributes, and text content of the attributes as seen in claims 1-3. Brown et 
al do not teach storing a zero binary value in the first row and second column position of the 
matrix to designate that the first and second documents are distinct. It would have been obvious 
to one of ordinary skill in the art, to have used a binary indicator (0 or 1) in Brown et al duplicate 
identifier position concerning only two documents, because while Brown et al is set up for a 
plurality of documents which could lead to many multiple dependencies, if a restriction of only 
comparing two documents was applied to Brown et al, there would be no need for more than one 
indicator (0 or 1) to clearly state the relationship between the first document and second 
document. 

-As per independent claim 12, Brown et al teach a system wherein a search engine 
process produces a hit-list (metadata parser) (Abstract), where it has already been stated above 
that the hit- list contains metadata (i.e. structure, attributes, and some text content), wherein a hit- 
list formatter processor selects one or more of. duplicate instances of one another (summary 

consolidator)(Abstract) with the ability of the formatter processor to delete duplicates from he hit 
list group (repository) (column 8, lines 20-22: Fig. 8). Brown et al also teach wherein the 
generated summary hit-lists are stored (in a repository) (Fig. 1 and Fig. 4: 410) 
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-As per dependent claim 13, Brown et al further teach that the formatter processor 
(summary consolidator)(Fig.4: 420 & 440) is configured to compare the metadata attributes 
values of the hit- lists (metadata summaries) and compare the text content (i.e. abstract or title) 
included in the metadata attribute hit-lists. As shown in claim 1, Brown et al does not show that 
its formatter is configured to compare the structures of the metadata hit lists. It would have been 
obvious to one of ordinary skill in the art, to have used Brown et al method of comparing hit- lists 
of attributes and used the structural relations of those hit-list attributes as the first determinate of 
distinct documents, because while both methods provide the same benefit of identifying distinct 
documents, by checking the structure (layout) of those attributes first, it can more quickly be 
established with the minimum amount of comparisons that the compared documents are distinct 
of one another (i.e. both documents have structures A & B, but since one has A before B and the 
other B before A they are considered distinct.) 

-As per dependent claims 14-16, Brown et al further teach wherein the system is 
configured to compare the attribute values included in the hit-lists, the text content included in 
the metadata hit- lists, and the metadata portion of the hit-list. 

-As per independent claim 17 and dependent claims 18 and 19, Brown et al teach the 
method of classifying electronically posted documents of claim 1-3 respectively. Brown et al 
does not teach that the method is a program product to be executed by a computer system stored 
in a computer readable medium. It was well known in the art to convert a method for use by a 
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computer system to a program product so that that method can be executed on said system and 
any other systems that can execute the program. It was also well known in the art that a 
computer system for executing said program product would have a recordable media (memory). 

-As per dependent claim 20, Brown et al further teach identifying documents as 
duplicates if text content of the metadata hit-lists (i.e. abstract or title) are equivalent (i.e. the 
structure and non-text attributes have been compared but alone can not determine duplicates) 
(Fig. 4: 460). 

-As per dependent claim 21, Brown et al further teach removing the duplicate metadata 
hit-list from the group (column 8, lines 20-22; Fig. 8). 

-As per dependent claims 22-24, Brown et al further teach the hit-list (metadata table) when an 

entry is identified as having one or more duplicates is cross referenced in the duplicate 

identifier field of the other duplicate (column 8, lines 1 1-16), wherein for two metadata entries 
the entry number and duplicate identifier field constitute a 2x2 matrix (Fig. 3b). Brown et al also 
teach that the process of identifying duplicates as distinct is achieved by comparing the metadata 
structures, attributes, and text content of the attributes as seen in claims 1-3. Brown et al do not 
teach storing a zero binary value in the first row and second column position of the matrix to 
designate that the first and second documents are distinct. It would have been obvious to one of 
ordinary skill in the art, to have used a binary indicator (0 or 1) in Brown et al duplicate identifier 
position concerning only two documents, because while Brown et al is set up for a plurality of 



Application/Control Number: 09/513,058 Page 9 

Art Unit: 2178 

documents which could lead to many multiple dependencies, if a restriction of only comparing 
two documents was applied to Brown et al, there would be no need for more than one indicator 
(0 or 1) to clearly state the relationship between the first document and second document. 

7. Claims 10-1 1 and 25-26 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Brown et al (US: 5,913,208, 06/15/99) in view of Microsoft Press Computer Dictionary, 
Microsoft Press, 1997, pp. 309. 

-As per independent claim 10, Brown et al teach the method of claim 1 as stated above, 
and further disclose receiving not just two, but two or more entries (column 3, lines 54-55) of 
documents and generating metadata hit-list summaries for each received document. Brown et al 
do not teach that when receiving a plurality of documents to group the metadata hit-list 
summaries according to the mime-type designation of each document. The Microsoft Press 
Computer Dictionary teaches us that MIME-types describe the contents of a document which can 
be used to interpret the content of a file over the internet (pp. 309). It would have been obvious 
to one of ordinary skill in the art, to have used Brown et al method for identifying duplicate 
documents from search results without comparing document content and grouping the hit-list 
summaries by MIME-type, because documents of different MIME-types wouldn't have to be 
compared as their intrinsic attributes would be wholly different, i.e. a regular text/plain MIME- 
type couldn't be a duplicate of a text/html MIME-type, and thus this would increase efficiency 
by significantly reducing the number of hit- list summary comparisons. 
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-As per dependent claim 1 1 . Brown et al and the Microsoft Press Computer Dictionary 
do not teach grouping more subsets of hit- list summaries by MIME-type. As it was taught above 
in claim 10, it would have been obvious to have grouped as many MIME-type subsets as were 
required by the plurality of documents for the purpose of increased efficiency by significantly 
reducing the number of hit-list summary comparisons across MIME-types. 

-As per independent claim 25, Brown et al teach the method for classifying electronically 
posted documents of claim 10 as stated above. Brown et al do not teach that the method is a 
program product to be executed by a computer system stored in a computer readable medium. It 
was well known in the art to convert a method for use by a computer system to a program 
product so that that method can be executed on said system and any other systems that can 
execute the program. It was also well known in the art that a computer system for executing 
said program product would have a recordable media (memory). 

-As per dependent claim 26. Brown et al does not teach grouping more subsets of hit- list 
summaries by MIME-type. As it was taught above in claim 25, it would have been obvious to 
have grouped as many MIME-type subsets as were required by the plurality of documents for the 
purpose of increased efficiency by significantly reducing the number of hit-list summary 
comparisons across MIME-types. 



Response to Arguments 
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8. Applicants arguments filed 07/09/04 have been fully considered but they are not 
persuasive. As discussed above, the Affidavit is not properly executed and fails to establish 
reduction to practice prior to the date of the reference. Applicant's arguments rely on the 
Affidavit establishing a date prior to that of the relied upon reference and are thus moot. 



Conclusion 

9. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Adam L Basehoar whose telephone number is (571)-272-4121. x 
The examiner can normally be reached on M-F: 7:00am - 4:00pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Steve Hong can be reached on (703) 308-5465. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 




